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Abstract 



We introduce a new technique to bound the asymptotic performance of splay trees. The basic idea is 
to transcribe, in an indirect fashion, the rotations performed by the splay tree as a Davenport-Schinzel 
sequence 5^ ^ none of whose subsequences are isomorphic to fixed forbidden subsequence. We direct this 
technique towards Tarjan's deque conjecture and prove that n deque operations require 0(na*(n)) time, 
where a*{n) is the minimum number of applications of the inverse- Ackermann function mapping n to 
a constant. We are optimistic that this approach could be directed towards other open conjectures on 
splay trees such as the traversal and split conjectures. 

1 Introduction 

Sleator and Tarjan proposed the splay tree ^ as a self-adjusting alternative to traditional search trees like 
red-black trees and AVL-trees. Rather than enforce some type of balance invariant, the splay tree simply 
adjusts its structure in response to the access pattern by rotating accessed elements towards the root in 
a prescribed way; see Figure [l] By letting the access pattern influence its own shape, the splay tree can 
inadvertently learn to perform optimally on a variety of access patterns. For example, the static optimality 
theorem states that splay trees are no worse than any fixed search tree. The working set, and dynamic finger 
theorems show that the access time is logarithmic in the distance to the accessed element, where distance is 
either temporal (working set [28 ) or with respect to key-space (dynamic finger [HI [7]). Sleator and Tarjan 
went a step further and conjectured that splay trees are, to within a constant factor, just as efficient as any 
dynamic binary search tree, even one that knows the whole access sequence in advance. Despite a fair amount 
of attention over the years, this dynamic optimality conjecture is still open. In fact, there is currently no 
non-trivial (i.e., sub-logarithmic) bound on the competitiveness of splay trees. The difficulty of this problem 
stems from the fact that splay trees were deliberately designed not to "play along." They do not adhere to 
any notion of good structure that we might have and, more to the point, there is no reason to believe that 
splay trees mimic the behavior of the optimal dynamic search tree. 



Figure 1: Splay trees' restructuring heuristics. After accessing an element x the tree rotates it to the root 
position by repeatedly applying a zig-zig, zig-zag, or zig as appropriate. On the left is the zig-zig case: x 
and its parent y are both left children (or both right children); the edges {y, z) and (x, y) are rotated in that 
order. On the right is the zig-zag case; the edges (x, y) and (x, z) are rotated in that order. Not depicted is 
the zig case, when y is the tree root and the edge (x, y) is rotated. 
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The renewed interest in the dynamic optimahty question is largely due to Demaine et al.'s invention of 
tango trees By appealing to the interleave lower bound of Wilbur [34 they show that tango trees are 
O (log log n) -competitive. Tango trees make use of red-black trees but it is easy to see that just about any 
standard binary search tree could be used as a black box in its place. Wang et al. [33] (see also [M]) showed 
that if splay trees are employed instead of red-black trees it is possible to have 0(loglogn)-competitiveness 
and retain some properties of splay trees, such as O(logn) amortized time per access and sequential access 
in linear time. Wang et al. also extended their data structure to handle insertions and deletions. 

If one's immediate goal is to prove that splay trees are simply o(log n)-competitive, it suffices to show 
that they run in o(n log n) time on any class of access sequences for which the optimal binary search tree 
runs in 0{n) time. There is currently no "theory" of access sequences whose inherent complexity is linear. 
It is, therefore, not too surprising that all the major open conjectures on splay trees (corollaries of dynamic 
optimality) concern sequences whose optimal complexity is linear. Whether one's goal is modest or ambitious, 
i.e., proving sub-logarithmic competitiveness or the full dynamic optimality conjecture, the first step must 
be to understand how splay trees behave on very easy access sequences. We restate below three unresolved 
conjectures on the optimality of splay trees j32| [28l \2b^ . 

Deque Conjecture Tarjan [32 conjectured that all double-ended queue operation^ (push, pop, and their 
symmetric counterparts inject and eject) take 0(1) amortized time if implemented with a splay tree. 
A push makes the root of the splay tree the right child of a new vertex and a pop splays the leftmost 
leaf to the root position and deletes it. Inject and eject are symmetric. 

Traversal Conjecture Sleator and Tarjan [28] conjectured that for two binary search trees S and T (defined 
on the same node set) accessing the elements in T by their preorder number in S takes linear time. 

Split Conjecture Lucas conjectured [25 that any sequence of splittings in a splay tree takes linear time. 
A split at X consists of splaying x to the root and deleting it, leaving two splay trees, each subject to 
more splittings. 

Sundar established a bound of 0{na{n)) on the time required to perform n deque operations, where 
a is the inverse- Acker mann function. Lucas [25^ showed that when the initial splay tree is a path (each node 
a left child), n split operations take 0{na{n)) time. Notice that the split conjecture subsumes a special 
case of the deque conjecture, where only pops and ejects are allowed. We are aware of no published work 
concerning the traversal conjecture. 

Our Contributions. We introduce a new technique in the analysis of splay trees that is fundamentally 
different from all previous work on the subject [28^, |32l |30l El [71 [T3[ . The idea is to bound the time taken 
to perform a sequence of accesses by transcribing the rotations performed by the splay tree into a Davenport- 
Schinzel sequence i.e., one avoiding a specific forbidden subsequence. We apply this technique to the deque 
problem and show that n deque operations take 0(na*(n)) time, where a*(n) is the number of applications of 
the inverse- Ackermann function mapping n down to a constant. This time bound is established by generating 
not one sequence ^ but a hierarchy of sequences, each of which avoids subsequences isomorphic to abababa. 
Nearly tight bounds on the length of such sequences were given by Agarwal, Sharir, and Shor [2^. We believe 
that a generalized version of this technique should be useful in resolving other open conjectures on splay 
trees. For instance, a particularly clean way to prove the deque, split, or traversal conjectures would be 
to transcribe their rotations as a generalized Davenport- Schinzel sequence with length 0{n). Klazar and 
Valtr [21 showed that a large family of forbidden subsequences have a linear extremal function. 

Related Work. lacono [TF defined a weaker notion of dynamic optimality called key independent optimal- 
ity. One assumes that keys are assigned to elements randomly. The optimal cost of a sequence of operations 
is the expected optimal cost over all key assignments. lacono showed that any data structure having the 
working set property is also optimal in the key independent sense. Blum et al. [5 defined another weaker 
notion of dynamic optimality called dynamic search optimality. In this cost model the (online) search tree 
can perform any number of rotations for free after each access, i.e., it only pays for actually doing searches. 

■"^Also called a deque, pronounced "deck." 
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Georgakopoulos [T3^ showed that splay trees are competitive against a large class of dynamic balanced binary 
search trees, which can be "self adjusting" in the sense that they change shape in preparation for future 
searches. lacono defined a unified property for search structures that subsumes the working set and dynamic 
finger properties. In his data structure [18 the access time for an element is logarithmic in its distance, where 
"distance" is a natural combination of temporal distance plus key-space distance. lacono's data structure 
[18] is not a binary search tree and it is currently open whether any offline binary search tree has the unified 
property. In other words, it is not known to be a corollary of dynamic optimality. 

Just after the invention of splay trees [28 , Fredman et al. [12 invented the pairing heap as a kind of 
self-adjusting version of Fibonacci heaps. There is no obvious (and still interesting) analogue of dynamic opti- 
mality for priority queues, though lacono [17] did show that pairing heaps possess an analogue of the working 
set property. See Fredman pjj and Pettie [26^ for the best lower and upper bounds on the performance of 
pairing heaps. 

List maintenance and caching algorithms (such as move-to- front or least-recently-used [27 ) are sometimes 
described as being self adjusting heuristics. In these problems the (asymptotic) dynamic optimality questions 
are pretty well understood [27 , though the leading constants have not yet been pinned down jSjH]. 

Organization. In Section [2] we describe a known reduction [32*, "30", "2F from the deque problem to a restric- 
tive system of path compressions. In Section [3] we define some notation for Davenport-Schinzel sequences, 
path compressions, and slowly growing functions. Section [4] introduces a recurrence relation for a type of 
path compression system and shows how it can be analyzed easily using bounds on Davenport-Schinzel 
sequences. Section Is] gives the proof that n deque operations take 0(na*(n)) time. 



2 Deque Operations and Path Compression Schemes 

The relationship between deque operations on a splay tree and halving path compressions on an arbitrary 
tree was noted by Lucas [24l|25] and implicitly in [32] [30]. Let us briefiy go through the steps of the reduction. 
At the beginning of a phase we divide the /c-node splay tree into left and right halves of size k/2. The phase 
ends when one half has been deleted due to pops and ejects. We ignore the right half for now and look at the 
binary tree induced by the left half; call it L and its root r. The root of this tree may or may not correspond 
to the root of the whole splay tree. In any case, we imagine rotating the nodes on the right spine across the 
root until the root has only one (left) child; call this tree L^ Finally, we transform the binary tree into a 
general tree L" as follows. The left child of a vertex in L' corresponds to its leftmost child in L" and its right 
child in corresponds with its right sibling in See Figure [2] Notice that if v and its L-parent PL{y) 
lie on the left spine of L, and Pl{v) 7^ r, rotating the edge {v^pl{v)) corresponds to making v the leftmost 
child of its grandparent in V . If Pl{v) — t rotating {v^pi^iv)) makes v the root of L but does not change 
the structure of L' or L" . In an almost symmetric way, if v,pl{v), and Pi {v) 7^ r lie on the right spine of 
L then rotating {v^pl{v)) in L corresponds to rotating {p\^\v)^pl{v)) in L' and making p\ / {v) the leftmost 

child of its grandparent {v) in L" . Notice that v = Pl'{pl{v)) = Pl' \Pl \^))- Observation 
the relationship between deque operations and path compressions; see also j25l [30]. 



2.1 



summarizes 



Observation 2.1 A pop operation corresponds to a halving path compression in L" that begins at the leftmost 
leaf and terminates at r, followed by a deletion of the leftmost leaf and a possible root relocation from r to 
its leftmost child. A push operation causes a new leaf v to be added as the leftmost child of r in L" , followed 
by a root relocation from r to v. An eject operation corresponds to a halving path compression originating 
at r and terminating at some ancestor (not necessarily the root of L" ), followed by a possible root relocation 
from r to PL"{r)- An inject operation has no effect on L" . 

It is clear that if the amortized cost per deque operation in a phase is f{k) then the overall cost for m 



deque operations on an initial n-node splay tree is 0((m+n)/(m+n)); see [32l |30l [TO] . Using Observation 2.1 
we can (and do) restrict our attention to bounding the total length of a sequence of halving path compressions 
up the left spine of an arbitrary rooted tree. However, we may still lapse into deque terminology. The terms 
"pop" and "push", for instance, are basically equivalent to "halving path compression" and "leaf addition." 
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Figure 2: Top row: the transformation from the splay tree (left nodes black, right nodes right) to L^V ^ and 
L" . Bottom: the effect of a pop on the splay tree and L" . 

Related Work. Restricted forms of path compressions have been studied in a number of situations. The 
most well known example is Hart and Sharir's result [15 on the equivalence between [ahaha)-b:ee Davenport- 
Schinzel sequences and generalized postordered path compressions; both have maximum length Q{na{n)). 
Loebl and Nesetril [22] and Lucas [24] independently proved that standard postordered path compressions 
with the so-called rising roots condition take linear time. Buchsbaum, Sundar, and Tarjan [6] generalized 
this result to what they called deque-ordered path compressions, again assuming the rising roots condition. 
Hart and Sharir [15] have conjectured that the rising roots condition is not essential and that standard 
postordered path compressions take linear time. The path compressions corresponding to deque operations 
are similar to the special cases studied earlier. Some differences are that the compressions are halving (not 
full), and although the compressions are spinal, due to pushes and ejects they are not performed in postorder 
and do not satisfy the rising roots condition. 



We say two sequences are isomorphic if they are the same up to a renaming of symbols. The relations <j ^ S 
and cr ^ S hold, respectively, when a is a subsequence of H and when a is isomorphic to a subsequence of 
H. A subsequence of S is any sequence derived by deleting symbols from H. A sequece S is called c-regular 
if any two occurrences of the same symbol appear at distance at least c. We denote by and the 
length and alphabet size of S, respectively. Following Klazar [20 we let Ex(cr, n) be the maximum length of 
cr- free sequences: 

Definition 3.1 Ex(cr, n) = max{|H| : <j 7^ S and \\T^\\ =n and S is \\a\\-regular} 

It is known [19 that Ex(cr, n) = n • 2^^^'>^^^\ where the 0(1) depends on the length and alphabet size 
of cr. Nearly tight bounds on Ex((j, n) are known [2 when <j is of the form ahaha---. Here a{n) is the 
inverse- Acker mann function, which can be defined as follows. If f{n) is a strictly decreasing function on the 
positive integers f*{n) = min{z : f^'^\n) < 2}, where f^^^^\n) = f{f^^\n)) and f^^\n) = f{n). Define 



3 Notation 
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a(m, n) = min{z > 1 : log* * ' ' ' *(n) < 2 + ^} and a{n) = a{n^ n). This definition of a differs from others 
in the literature [31125] by at most a small constant. 

In this paper all trees are rooted and the children of any vertex assigned some left-to-right order. The 
trees we deal with are occasionally restructured with path compressions. Let p{u) be the parent of u at some 
specific time. If C = (i^i, . . . ^Uk) is a path, where Ui-^i = p{ui)^ performing a total compression of C means 
to reassign p{ui) = i^/c, for all 1 < i < /c — 2. A halving compression of C sets p{ui) = for all odd 
i < k — 2. If /c is even a halving compression may set p{uk-i) = p{uk)- A specific case of interest is when 
Uk is the tree root: setting p{uk-i) = p{uk) causes Uk-i to be a tree root as well. We say that C originates 
at ui and terminates at ix/e. A total/halving path compression that does not terminate at the tree root is 
stunted. The length of a total/halving path compression is the number of vertices whose parent pointers are 
altered. We never consider compressions with zero length. 

A postordering of a tree rooted at having children vi, . . . from left to right, is the concatenation of 
the postorderings of the subtrees rooted at vi^ . . . ^Vk followed by v. The spine of a tree is the path from its 
root to its leftmost leaf. A path compression is spinal if it affects a subpath of the spine; it need not include 
the leftmost leaf nor the tree root. It is easy to see that a total/halving spinal compression can be postorder 
preserving. For a total compression {ui^ . . . , Uk) we just prepend i^i, . . . , Uk-2 to the preexisting left-right 
ordering on the children of u^- For a halving compression we make Ui the new leftmost child of Ui-^i for all 
odd i. 

In order to use a clean inductive argument we look at a restrictive type of instance called a spinal 
compression system. The initial structure consists of a single path containing a mixture of essential nodes 
and and fluff nodes. The tree is manipulated through halving spinal path compressions, leaf deletions, and 
spontaneous compressions. Let us go through each of these in turn. Whenever a fluff node becomes a leaf 
it is automatically deleted. The leftmost essential node (always a leaf) may be deleted at any time. We are 
mainly interested in the total length of the "official" path compressions, which are always halving and spinal. 
Spontaneous compressions are any postorder preserving path compressions, the cost of which we need not 
account for. Let ^{n, /, m) be the maximum total length of the official compressions on an instance with n 
essential nodes and / fluff nodes, where at most m of the compressions are stunted. In Section |4] we derive 
a recursive expression bounding ^(n, /, m) and in Section [s] we relate ^ to the time required for deque 
operations. 

4 Recursive Bounds on Spinal Compression Systems 

Consider an initial instance with n essential nodes and / fluff nodes. We first divide up the path into 
n/B blocks, each containing B essential nodes, where the bottom most node in each block is essential. 
The sequence of official compressions is partitioned into n/B epochs, where the jth epoch begins at the first 
compression when the leftmost leaf belongs to the jth block and ends at the beginning of the (j + l)th epoch. 
Let Ij be the set of nodes, not including those in the jth block, that are touched by some compression in the 
jth epoch. It follows that at the commencement of the jth epoch Ij is a single path; see Figure [3] At this 
time some members of Ij may be affiliated with previous epochs. It is always the case that nodes affiliated 
with any / < j appear as a connected subpath of the spine. We call an essential node v exposed if it has no 
essential ancestor that either shares an affiliation with v or lies in the same block as v. Let Ij C Ij be the 
set of essential exposed nodes. We call epoch j sparse if \Ij \ log \Ij\ < \Ij\ and dense otherwise. If epoch j 
is dense we affiliate all nodes in Ij with j. We view Ij as a separate spinal compression system, where Ij is 
the set of essential nodes and all others in Ij\Ij are fluff. Notice that a subpath of Ij can be affiliated with 
j and a previous epoch, say j' . If a compression influences nodes affiliated with both j and j' we charge the 
cost to the jth spinal compression system and call its effect on the j'th compression system a spontaneous 
compression. If the jth epoch is sparse we do not affiliate any nodes with j. 

Lemma 4.1 Let f > fj and m > fhj + rfij . 



n/B 



^{nJ,m) = ^^{BJj,mj)^ 



3^(|/, I, \ij I log \ijlfhj) + 2m + n + / 



epoch j dense 
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Figure 3: 



Proof: Consider a compression C occurring in the jth epoch. We can always break C into three parts: (i) 
a compression C inside the jth block followed by (ii) an edge e leading from the jth block to some vertex in 
/j, followed by (iii) a compression C" lying entirely within Ij. Notice that if C" is present it may be either 
stunted or unstunted. However, if C" is present then C (if present) must be an unstunted compression. We 
call C" short if it has unit cost, i.e., if only one node changes parent. Whether C is stunted or not its cost is 
covered in the M{B^ fji^j) term, where fj is the number of fluffy nodes and rfij the number of compressions 
that are stunted with respect to block j. Similarly, if the jth epoch is dense the cost of C" ^ whether stunted 
or not, is covered in the term \ log |/j|,mj), where fhj is the number of stunted compressions that 

terminate in the spinal compression system on Ij. We account for the cost of e in one of two ways. If C" 
(and therefore C) is stunted we charge it to the compression; hence the m term. If not, notice that after 
every unstunted compression C" the number of nodes whose grandparents are roots or nonexistent increases 
by at least one. This can obviously happen at most n + / times. Consider the compression C" when the jth 
epoch is sparse. If C" is short we charge its cost to the compression; hence the other m term. In general, 
C" will intersect one or more previously spawned spinal compression systems. That is, it can be divided 
up into pieces C'l^G^^C''^^ . . ., where for odd i, C'^ intersects an existing compression system. The costs of 
C3 , . • • are covered in the \ log m^/) terms. By doubling this sum we can account for 

some of C2 , C4 , . . .; at the very least this includes those with unit cost. Over all of epoch j, the total length 
of the other mini-compressions of the form C'^ {i even) is \ log |/j|; see Sundar [30 By tripling the sum 
\ log \Ij'\^fhj') we account for the cost of these mini-compressions as well. □ 
looks as though it may be effectively vacuous. We are trying to bound the growth of ^(n, /, m) 



Lemma 



4.1 



and all Lemma 



4.1 



can say is that it depends on the magnitude of the sets. It does not even suggest a 

trivial bound on their size. Our strategy is to transcribe the sets as a repetition- free sequence y that 

avoids a speciflc forbidden subsequence. It follows from the results of Agarwal et al. [2 that \y\ is nearly 
linear in the size of its alphabet. By choosing an appropriate block size B we can apply Lemma [4!!] to obtain 
useful bounds on 



Our transcription method for y is very similar to the one used by Hart and Sharir [15 . In Lemma 4.2 



we show that abaabba 5^ and abababa -/< 5^ . Using bounds on Ex{abababa^ •) from Agarwal, Sharir, and 



Shor [2 we are able to show, in Lemmas 4.3 and 4.4 that ^(n, /, m) is 0((n + / + m)a*{n)) 



^In Sundar 's terminology each of these mini-compressions is a fc-cascade for some k > 2. 
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Before giving the actual transcription method we give a simpler one, point out why it isn't quite what 
we need, then adjust it. The transcription is based on an evolving labeling of the nodes. A label is simply 
a sequence of block/epoch indices, listed in descending order. Since nodes can belong to several spinal 
compression systems a node may keep several labels, one for each system. At the commencement of the 
jth epoch we select out of Ij a subset Ij with several properties, one of which is that Ij has at most one 
node from any block. If the jth epoch is dense we prepend j to the label of each node in Ij . The sequence 
y consists of the concatenation of all node labels, where the nodes are given in postorder. An equivalent 
definition is that begins empty; whenever the leftmost leaf is deleted we append its label to y and 
continue. Besides having the property that abaabba^ abababa we need 5^' to be repetition- free and 

contain not too many occurrences of any one symbol, say, at most t occurrences. To enforce this, if >t 
we simply split it up into pieces and treat each piece as a distinct spinal compression system. Thus, 

the number of systems could exceed the number of blocks/epochs. There may be repetitions in 5^' but not 
too many. Note that the labels of nodes appearing in any block share no symbols in common. Therefore, 
5^ can only contain repetitions at block boundaries. By removing at most n/B — 1 symbols from we 
can eliminate all repetitions. Using the above two observations, let ^ be a repetition-free sequence derived 
from y in which no symbol occurs more than t times. 

Lemma 4.2 For a G {abaabba^abababa} , a y . 

Proof: First note that if the lemma holds for y it holds for 5^ as well. We show that for any b > 
babba y . This would prove the lemma since, for any a' G {babba^abaab} and a" G {abaabba^abababa}^ 
cr' ^ cf" . Consider the commencement of the 6th epoch. If no nodes affiliated with a appear on the path 
then all nodes labeled with b occur, in postorder, strictly before or strictly after all nodes labeled with a; 
see the left part of Figure ffl In this case baba ^ y . The more interesting case is when some interval of the 




Figure 4: 



nodes in are affiliated with a. Recall that in our transcription method only exposed nodes were labeled 
and there could be at most one exposed node in each set of nodes with a common affiliation. Thus, only 
one node affiliated with a can be labeled b. All other nodes labeled b occur strictly before or strictly after 
all nodes labeled a. Thus, after the appearance of babb in y all nodes labeled a would have been deleted. 
We conclude that babba ^ y . □ 



Lemma 4.3 incorporates the recursive characterization of M from Lemma 4.1 and the (a6a6a6a)-freeness 



of y established in Lemma 1421 



Lemma 4.3 



n/B q 

^(n, /, m) = ^ ^{B, fj.rfij) + 3 J ^{k, k log k, 2k log' + 2m + n + / 

3=1 1=1 
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where 

f > fj^ ^ > k<t, q< + f , Ei < Exiabababa, q) 

Proof: Here {li}i represent the sizes of the spawned spinal compression systems. They would correspond 
with the epochs/blocks if we did not artificially break them up to guarantee that each k < t. It follows 



that the number of spawned systems is at most q = {^^li)/t + n/B. By Lemma 4.2 we have < 
Ex[abababa^q). The term ^ {li^li log li ^ 2li log^ li) reflects the cost of the ith spawned compression system, 
with li essential nodes and at most li log li fluff nodes. By [30] the number of stunted compressions (with 
greater than unit cost) terminating in this system is at most (/^ + k log \og{li + k log li) < 2li log^ □ 

Lemma 4.4 ^(n, /, m) = 0((n + / + m)a*(n)) 

Proof: Let mm{Ex{abaabba, z), Ex{abababa^ z)} = z- P{z). From the bounds established by Hart a nd S harir 
[H] and Agarwal et al. [2] we only know that f3{z) = Q{a{z)) and 0(a(z))"'^^^ We apply Lemma 
B = t = p^{n): 

Ij < mm{Ex{abababa^q)^Ex{abaabba^q)} 



4.3 



with 



< mm{Ex{abababa, — ^ + —),Ex{abaabba, — ^ + ^)} 



(3{n) f3{n) 

Thus Ij < n/{P{n) — 1). Assume inductively that ^(n, /, m) < c{n + / + m)j{n), where j{n) is the 
minimum i such that (/3^)^*^(n) is less than some large enough constant. (We need this constant threshold 
since /3^(n) is not necessarily decreasing for very small n.) 

n/B q 

^{nj,m) < '^^{BJj,rhj)^^3^{li,li\ogli,2li\og^li)^2rn^n^f 

j=i i=i 

< c(n + / + rhj)j{p{nf) + 3c (^ ^^^^_^ (1 + log(/?^(n)) + 2 log^(/?2(n)))^ 7(/5'H) + 2m + n + / 

< c(n + / + 5];^. m^-)(7H - 1) + o{n) + 2m + n + / 

< c(n + / + m)j{n) {for c sufficiently large} 

It is easy to see that /3^(/3^(n)) < a{n) for n sufficiently large, from which it follows that 7(n) = 0(a*(n)). 
□ 

5 Upper Bounds on Deque Operations 

Theorem 5.1 Starting with an n-node splay tree, the time required for m deque operations is 0((m + 
n)a*(m + n)). 

Proof: We reduce the deque problem to a set of spinal compression systems. The main difference between 
these systems and the deque problem as a whole is that spinal compression systems do not allow general 
insertion/deletion of leaves and the initial tree is always a single path. 

We impose a linear order on all nodes that ever appear in the splay tree. A pushed node (injected node) 
precedes (proceeds) all nodes that are currently in or were in the splay tree in the past. As in the general 



"^Klazar's and Valtr 21 claimed that Ex(a6aa66a, n) = G{na{n)) could be had by fiddling with Hart and Sharir's proof [15| . 
This claim was later retracted [20j. 
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reduction from Section |2] we divide the access sequence into phases. At the beginning of each phase the 
nodes are separated into equahy sized left and right sets, say m' in each set. We partition the left set into 
blocks of size B = (3{m')'^. Let the jth period begin at the first pop of an element in the jth block and end 
when all of the jth block has been popped. Note that the pop that begins the jth period could delete any 
element from the jth block, not necessarily the first. Also note that periods are not necessarily disjoint. For 
example, just after the jth period begins we could push B elements and then perform a pop. This would 
have the effect of starting the /th period without ending the jth period, where / > j. Only one period is 
active^ namely, the one whose block contains the leftmost leaf in the tree. All other periods are on hold. 

Let Jj be the set of nodes that participate in a compression in the active part of the jth period, excluding 
those that lie in block j. Clearly the nodes in Jj lie on one path when the jth period begins. However, they 
are not necessarily contiguous. For example, if the jth period is put on hold, compressions from later periods 
could evict non-Jj nodes from the path between two Jj nodes. If we put ourselves in one period's point of 
view we can assume without loss of generality that Jj does form a contiguous segment since, by definition, 
this period never "sees" any non- Jj nodes that would expose its misbelief. 

Let &{m') be the maximum cost of deque operations (i.e., the cost of push, pop, and eject operations that 
affect the left set) when the total number of nodes is m' . After the jth period begins the jth block functions 
as a mini-deque structure and its total cost at most &{B). Until the jth period begins the block- j nodes 
may participate in compressions, the total cost of which is B. (Whenever a node is evicted from the left 
spine of the tree induced by block- j nodes it can only be seen again after the jth period begins.) The cost of 
compressions inside the {Jj}j sets can be expressed in terms of the ^ function. Again, at the commencement 
of the jth period we identify the exposed nodes Jj C Jj. We do nothing if it is sparse {\Jj \ log \Jj\ < \Jj\) 
and if it is dense, we affiliate all nodes in Jj with j and spawn a separate spinal compression system on Jj. 
The proofs of Lemmas |4.1| and |4.3| can easily be adapted to show that: 



, m' /B 

&{m')<^^{B)^0{m'^ ^ ^(|J,|,|J,|log|J,|,2|J,|log^|J,|)) 
where \Jj\ < Ex{abababa,m' /B) = {m' / B)l3{m' / B) < m' /[3{m'). A simple proof by induction (along 



the lines of Lemma 4.4) shows that &{m') = 0(m'a*(m')). □ 



6 Conclusion 

It's fair to say that previous analyses of splay trees could be characterized as using potential functions [281IT3]. 
counting arguments [32l [25] [30l [10] or a complex synthesis of the two [HI [7] . Although these techniques have 
been wildly successful in proving (or nearly proving) the corollaries of dynamic optimality, they have had 
no impact on the dynamic optimality question itself. The reason for this is probably the fact that so little 
is known about the behavior of the optimal (offline) dynamic binary search tree; see [23, 5 . It is worth 
noting that the 0(log log n)-competitiveness of tango trees and their variants [9l[33l[T4] was established not 
by comparing them against an optimal tree but one of Wilber's lower bounds on any binary search tree [34]. 
(Contrast this technique with the more direct approach used by Sleator and Tarjan [2T in their proof of the 
dynamic optimality of Move- To-Front.) 

The strategy taken in this paper is quite different from previous work and is clearly general enough to 
be applied to other open problems concerning splay trees. By transcribing the rotations performed by the 
splay tree into a Davenport- Schinzel sequence avoiding abaabba and abababa we were able to use an off-the- 
shelf result of Agarwal et al. [2 and avoid an unmanageable bookkeeping problem. One direction for future 
research is to study the relationship between splay operations, various transcription methods, and various 
forbidden subsequences. Of particular interest are those forbidden subsequences whose extremal function is 
linear since it is these that would be fit for finally settling the deque, split, and traversal conjectures. See 
Adamec et al. [1 and Klazar and Valtr |21j for a large family of linear forbidden subsequences. 
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